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Abstract 

A general formalization is given for asynchronous multiple access channels which admits different assumptions 
on delays. This general framework allows the analysis of so far unexplored models leading to new interesting 
capacity regions. In particular, a single letter characterization is given for the capacity region in case of 3 senders, 
2 synchronous with each other and the third not synchronous with them. 

1 Introduction 

Ahlswede [T] and Liao [11 showed that if two senders communicate synchronously over a discrete memoryless multiple 
access channel (MAC) which is characterized by a stochastic matrix W{y\xi,X2), it is possible to communicate with 
arbitrary small average probability of error if the rate pair is in the following pentagon: 

< i?i < I{Xi A Y\X2) 
< i?2 < I{X2 A Y\Xi) 
Ri+R2<I{Xi,X2AY) (1) 

for some independent input random variables Xi, X2, where P{Y = y\Xi — xi,X2 = X2) — W{y\xi,X2)- Moreover, 
the convex hull of the union of these pentagons can also be achieved via time sharing, while no rate pair outside this 
convex hull is achievable. 

The discrete memoryless asynchronous multiple access channel (AMAC) arises when the senders can not synchronize 
the starting times of their codewords, rather, there is an unknown delay between these starting times. Cover, McEliece 
and Posner [Sj showed that if the delay is bounded by 6„ depending on the codeword length n such that ^ — 5> then 
the convex closure is still achievable by a generalized time sharing method. 

Poltyrev |14| and Hui and Humblet [10] addressed models with arbitrary delays known (in pH] ) or unknown (in 
|10j ) to the receiver. For such models, the capacity region was shown to be the union of the pentagons above. Verdu 
|15| studied asynchronous channels with memory. His model slightly differs from common models: the time runs over 
a torus rather than from —00 to 00. Later, Grant, Rimoldi, Urbanke and Whiting in [7] showed that in the informed 
receiver case the union can be achieved by rate splitting and successive decoding. For the uninformed receiver case, 
however, the theory of AMAC contained gaps, which remained untouched for a long time, see Appendix A for details. 
In essence, these gaps were filled in the book of El Gamal and Kim [B] . 

This paper is an extended version of the ISIT 2011 contribution [S], originating from the authors' effort to derive the 
AMAC capacity region without gaps in the proof (the result in jH] was unknown to us at the time, as was, apparently, 
to the reviewers of [5]). More than doing that, in |5] a general formalization for AMACs was introduced, allowing 
dependence of the capacity region on the distribution of the delays, typically through the support of that distribution. 
For a particular (somewhat artificial) choice of the delay distribution, the capacity region was determined which, as 
shown by an example, could be strictly between the union and its convex closure. 

The main technical result of this paper is new compared to [5], it was merely announced in the authors' ISIT 2011 
talk. It is a single letter characterization of the capacity region for 3 senders case, where two senders are synchronized 
with each other and the third is unsynchronized with them. 

In section 2 we give the formal description of the AMAC model, where several possible definitions are given, which 
are analyzed in parallel. In Section 3 a general converse is presented. This converse is used when capacity region of 
known (in section 4) and previously unknown (in sections 5 and 6) models are derived. 

Our achievability proofs rely on the techniques of rate splitting and successive decoding developed by Grant, 
Rimoldi, Urbanke, Whiting and Rimoldi |I5]. 

*This paper has been submitted to IEEE Transactions on Information Theory. It has been presented in part at ISIT 2011, Saint 
Petersburg. L. Farkas is with the Department of Analysis, Budapest University of Technology and Economics, e-mail: IfarkasOmath. bme.hu. 
T. Koi is with the Graduate School of Mathematics and Computer Science, Budapest University of Technology and Economics, e-mail: 
koitomi@math.bme.hu. 
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2 Model of coding for the AMAC 



In this paper vectors (finite sequences) will be denoted by boldface symbols. 

A iiT-senders asynchronous discrete memoryless multiple-access channel (/C-AMAC) is defined in terms of K finite 
input alphabets A^, i € {1, 2, . . . , if}, a finite output alphabet y. and a stochastic matrix Vt^ : A*! x ^2 x • • • x Xk — > y 
describing the probability distribution of the output given the inputs. 

Definition 1. A code-book system of block-length n with rate vector R = i?2 . . . , Rk) for a given K-AMAC W 
consists of K code-books Ci, C2, . . . , Ck , where the code-book Cm of the m-th sender has 2"^™ codewords of length n 
whose symbols are from X„i- 

The system is symbol synchronized but not frame synchronized. The differences between the timing of the receiver 
and the timings of the senders are represented by a K-tuple of delays as in Definition [3j 

The senders have two-way infinite sequences of random messages, and assign codewords to their consecutive 
messages. The codewords go through the channel. The sequences of the senders' codewords and hence also the 
output symbol sequence are two-way infinite sequences. Fix the location of the 0-th output symbol. The message of 
sender m € {1,2,..., K} whose codeword affects the O'th output is denoted by Mm,o- This restricts the delays to be 
in the set {0,1, . . . ,n — 1}. Formally, we use the following definitions: 

Definition 2. For each integer j (z 'Z, and for each m G {1,2, . . . , K} let M^.j be a uniformly distributed random 
variable taking values in the set {1, 2, . . . , 2"^™}. All these random variables are independent of each other. The 
two-way infinite sequence {Mm.j,j € represents the message flow sent by the m-th sender. For each integer j € Z 
and for each m G {1, . . . , K} let X^j- denote the Mm.j-th codeword in the code-book of sender m. Let Xm.nj+i be the 
i-th symbol ofH-mj where i G {0, 1, . . . , n — 1}. 

Definition 3. For each n G Z+, let 

D(n) = {Di{n),D2{n),...,DKin)) 

be a K-tuple of random variables, not necessarily independent of each other but independent of all previously defined 
random variables, taking values in the set {0, 1, . . . , n — 1}. Dm{n) will represent the delay of sender m relative to the 
receiver's timing. The joint distribution of delays is known to the senders and the receiver. The realizations of the 
random variables Di{n), D2(n), . . . ,DK{n) are not known to the senders and, depending on the model, may be known 
or unknown to the receiver. The sequence D = {D(l), D(2), . . . , D(n), . . . } will be called the delay system. With a 
slight abuse of notation, we also write D instead ofT){n). 

Remark 1. Our definition allows arbitrary distributions for the delays for each blocklength n. Clearly, in practical 
models these distributions can not be arbitrary, but have to satisfy consistence conditions. We have chosen this general 
model since we think that any practical model can be described this way. 

Example 1 

For each n G Z+ and for each m G {1, 2, . . . , K} Dm{n) has uniform distribution on {0, 1, . . . ,n — 1} and they are 
independent. Following flO^ it is called the totally asynchronous case in the paper. 

Example 2 

Let K = 2, for each n G Z+ let Di{n), D2{n) be independent random variables uniformly distributed on the even 
numbers of {0, 1, . . . ,n — 1}. R is called the even delays case in the paper. 

Example 3 

Let K — 3, for each n G Z+ let Diin) — D2(n) be a random variable uniformly distributed on {0, 1, . . . , n — 1} and 
let Dy,{n) be a random variable independent of Di{n) and uniformly distributed on {0, 1, . . . ,n — 1}. R is called the 
partly asynchronous three senders case in the paper. 

For fixed n, the output sequence is defined as follows: 
Definition 4. Let Ynj+i be the output random variable of the channel with transition matrix W when the inputs are 

Xl,nj+i+Di(n), ^2,nj+i+D2{n), • ■ • ; ^ K,nj+i+D k {n) whcrC i G {0, 1, . . . , n — 1}. 

It is possible to define the decoder in several ways. We will consider two different definitions, which give the 
strongest version of the converse and direct parts of the coding theorems, respectively. 

Definition 5. An informed infinite decoder is defined as a function which assigns to each two way infinite output 
sequence realization {yi,l G Z} and each realization of T)(n) = {Di{n), D2{n), . . . , Dxin)) ,n G a K-tuple of 

messages {rhm.o, m G {1, 2, . . . , K}}. 

Definition 6. An uninformed L-block decoder, L G Z^", is defined as a function which assigns to each {2Ln-\-l) -tuple 
{yi, I G {—Ln, . . . , 0, . . . , Ln}} of possible output realizations a K-tuple of messages {rrim,o, m G {1, 2, . . . , K}}. 
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Figure 1: The Setting for Two Senders 



The code-books and the decoder form an n-length coding/decoding system. 

The definitions above determine the probabihty structure of the model. For each m the random variable sequence 
{Mm.j^j G is the two way infinite message flow of the m-th sender. The corresponding flow of codewords is 
{Xmj-,j e Z}. The flows of the senders, the channel transition and the delay system D, define a two way infinite 
output random variable sequence {Yj, / € Z}. In case of uninformed X-block decoder the receiver examines the output 
block Y^Ln,Y^Ln+i, ■ ■ ■ ,Yo,Yi, . . .Y^n from which estimations {Mm,0:"^ €E {1,2,...,K}} are created. In case of 
an informed infinite decoder the whole output sequence and the realizations of delays are used in the estimations 
{Mm,o, m S {1, 2, . . . , K}}. It is assumed that the same but shifted decoding procedure occurs at the output points 
{nk, k £ Z}. Hence the random variables of the estimations {Mmj, m G {1, 2, ... , K},j E Z} are also defined. See 
Fig. [1] for this model, in case K = 2. 

We will consider two different error definitions. As standard for multiple-access channels, both errors are average 
error over messages. However, our first error type also involves averaging over delays, while the second one takes 
maximum over the possible delays. 

Definition 7. The average error is the following: 

K 



lm=l J 

Definition 8. The maximal error is the following: 

Pr(*) = max Pr I U 1^^"." ^ ^^™-o| I^H = . (3) 

d(n):Pr{D(n)=d(n)}>0 ) L ' . J i v / \ / | 



Remark 2. The average error depends on the joint distribution of delays D2{n), . . . , Dx{n)), while the maximal 

error depends on the joint distribution of the delays only through its support. 

Remark 3. The two kinds of error are related very closely. If P^i*) then P" — >■ 0. On the other hand, if — >■ 
exponentially as n — > 00 and if niind(„).pj{]3(„)=(j(„)}>o Pr {D('T') = d(n)} tends to slower than exponentially then 

also P"(*) — > exponentially. 

We have defined several types of models according to the various definitions of decoder and of error. For the sake 
of brevity, the following definition is meant to define a capacity region simultaneously for all cases. 

Definition 9. Corresponding to the delay system D, the rate vector {Ri, R2, ■ ■ ■ , Rr) is achievable if for every e > 0, 
(5 > for all N € Z+ there exists a coding/decoding system with blocklength n > N with rates coordinate-wise exceeding 
(i?i — (5, i?2 — (5, • ■ • , Rk — S) and with error less than e. The set of achievable rate vectors is the capacity region of the 
K-AMAC. 
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Remark 4-- In the definition above we used the 'optimistic' definition of capacity region, rather than the more usual 
'pessimistic one', see [4|^ The reason is that in the even delays case there are differences in the performance of 
coding/ decoding systems of even and odd blocklength (see Theorem |4]) . 

Remark 5. If for some region achievability is proved in case of uninformed L-block decoder with maximal error, and 
the converse is proved in case of informed infinite decoder with average error, then for any combination of the model 
assumptions above the capacity region is equal to this region. 

Here, in case of L-block decoder, a proper choice of L is understood. In particular cases, a suitable L will be specified, 
not entering the question whether a smaller L would also do. 

Lemma 1. For either type of AMAC model, i/D and D' are two delay systems such that for some < a < 1 for all 
n G Z+ and d(n) e {0, 1, . . . , n - 1}^ 

Pr {D'(n) = d{n)} > aPr {D(n) = din)} , (4) 

then the capacity region under delay system D' is contained (perhaps strictly) in the capacity region under delay system 
D. 

Proof: Consider an arbitrary n length coding/decoding system. Then P^j^/^^^ > «-P"D(n) ^^^^ ^eT}'{n)(*) — ^eT}{n)i*) 
hold, where the lower indices indicate the underlying delay system. This proves the lemma. ■ 

Remark 6. In case of any type of decoder, if two delay systems D and D' have the same support set for each n, then 
the capacity regions corresponding to delay systems D and D' coincide in case of maximal error. Furthermore, if the 
equation Q is fulfilled by D and D' and it is also fulfilled when the roles of D and D' are reversed, then by Lemma 
[T]the capacity regions also coincide in case of average error. 



3 A general converse 

In this section a general converse theorem is proved, which depends on the delay system. In the following sections, 
this general converse is used to derive the capacity region of special cases. 
Write [i] = {1, 2, . . . , i}. For all subset S of [K] write 

Xs = (X,).es (5) 
= [K] \ S, (6) 

and for all R — (i?i, i?2, ■ • ■ , Rk) write 

Let D denote the delay vector. Let X^.i+Db denote the random vector with components Xi^i^jj^, I g B where B 
C [K] and Xj^ j is defined as in definition [2j similar notation is used where + is replaced by © which means addition 
modulo n. 

Theorem 1. For any n length coding/decoding system for a K senders AMAC W with informed infinite decoder, the 
following bounds hold for the rate vector R = (i?i, i?2, . . . , Rk) for all S C [K]: 

R{S) <l{y.S,QS>Ds AYQ\y.s^^QS>Ds..Q,'D) + en. (8) 

Here En — {R{[K]))Pl^ + ^, the random variable Q is uniformly distributed on {0, 1, . . . , n — 1} and independent of D 
and the message flows of the senders. Further, Yq is linked to the random, variables ^i.QeDi j ^2,q©D2 i • • ■ t Xx.QfsDK 
through the channel W ; formally, its conditional distribution given Q, D and Xi^q^^i^ , X2.q®_D2' ■ • ■ ' Xk,q®Dk depends 
only on the values xi, X2, ■ • . , xk of the latter random variables and is equal to W{-\xi, X2, ■ • ■ , xk)- 

Remark 7. Theorem [l] will be used for sequences of coding/decoding systems with P" — >• 0. In this case £„ also tends 
to 0. 

Proof: For the sake of clarity just the two senders special case is addressed here, the full proof of Theorem [T] can be 
found in Appendix B. In case of two senders the bounds ([s]) are: 

Rl <l{Xi^Q(SD, AYq\X2,Q(sD,,Q,Di,D2) + En (9) 
R2 <l{X2^QmD, /\Yq\Xi^Q(SD,,Q,Di,D2) + En (10) 
Rl+R2<l{Xi^Q@D,.X2,Q(SD,AYQ\Q,Di,D2)+En (11) 
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Figure 2: The random variables that play role in the bound on the sum i?i + R2 



Note that ne„ = n{Ri + R2)Pe + 1- Hence 

ne„ > H(Afi,o, Af2,o|A/i,o,M2,o) (12) 

by Fano's inequality. 

We just bound Ri + i?2- The bounds for i?i and R2 can be derived similarly (See also Appendix B). 

Take a window of the receiver consisting of + 1 n-length blocks Y^+^ — {Yq, Yi, . . . ^(Ar+i)_i}. This window 
fully covers the code-blocks Xi^i,Xi^2, ■ • ■ ,Xijv of sender 1 and X2,i,X2,2, • ■ • ,X2,Ar of sender 2, denoted by X^ 
and X^ respectively. The codewords at the sides of the output window are Xi^Oi X2,o, Xi_jv_i_i, X2,Ar+i, denote this 
quadruple by X**™ (the expression "side of the windows" is abbreviated by the index sw). Then we have 



Nn{Ri + R2) = 
= H(Mf,Mf) 
= I(Mf AMf ,M^)+ 
+ H(Mf ,M^|Mf ,M^) 
<I(Mf AMf ,M^)+ 

N 

+ ^H(Mi^„M2^,|Mi^„M2,0 



<I(Mf AMf ,M^) 



<I(Xf ,X^ A Y^+\X^'" 



+ NnSn 
Di,D2) 



Nnsn 



where (17 1 comes from (12 1 and ([18| comes from the Markov relation 

(Mf , M^) ^ (Xf , Xf ) ^ (Y^+i, X^"', A, 7^2)^ 
^(Y^+\Y=)^(Mf,Mf). 



(13) 
(14) 

(15) 

(16) 

(17) 
(18) 



(19) 



Note that in ^ the Markov relation (Xf ,Xf) ^ (Y^+\ Di, L>2) ^ (Y^+\Y'=) was assumed, which need not 
hold in general. It seems that Poltyrev [14] also made this error. 



Continuing the estimations ( 13 l-([T8|), 

Nn{Ri 



R2) < 



<I(Xf ,X™ A Y'^+\X^™,Di,2?2) 



Nne„ 



:H(Xf ,X 



- H(Xf , Xf I Y^+i , X'^"' , Di , D2 ) 
= H(Xf,Xf |i^l,i?2) + Ar^£„- 
-H(Xf,Xf|Y^+^i^l,i?2)+ 
+ H(Xf,Xf|Y^+\i^i,i?2)- 

^^+\X^'",i^l,i?2) 



(20) 
(21) 



H(Xf , X^^ I Y^ 



(22) 



^In short, in the 'optimistic' definition it is enough to show that there is a "good" coding/decoding system for a sequence of blocklength 
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+ ^X""- A Xf , Xf |Y^+\ Di, D2) 

< H(Y^+i| A, i?2) - H(Y^+i|Xf , Xf , D,,D^)+ 
+ 4nlog|A'| +Nnen 

= ll(Y'^+^\Di,D2)+An\og\X\+Nnen- 

-EE H(r„,+.|Y;'^+^"'xf , Xf , i^i, 7^2) 

j=0 1=0 
(Af+l)n-l 

< ^ H(y, |A,Z?2)+4n log 1^-1+ iVne„- 

N-l n-1 

,nj+i+D2 J Di,D2). 

j=l i=Q 



(23) 
(24) 

(25) 



(26) 



In (26 1 we dropped some negative terms (notice that j runs from 1 to — 1). Introduce the random variable 
linked to the random variables Xi^i0d ,-'^2,ieD2 by the channel W for all i E {0, 1}, where denotes the 
addition modulo n. For all j the joint distribution of (Di , D2 , -^i.w7+i+ _Di ; ^2,nj+i+ D2 1 ^nj+i) is the same as the joint 
distribution of (_Di, I?2, -'^^i.ieDi j -'^2,ie02 ' ^j)- Using this substitution, (26 1 can be further bounded from above by: 



<(A^ - 1) E H(K.| A, i^2) + 2nlog 13^1 + 4nlog 

i=0 
n-l 

- (TV - 1) ^ H(f,|Xl,,eDlX2,^eD2, A, ^^2) + NnSr, 



i=0 



n-l 



={N - 1) ^ I(^i,.©i?i , ^2,.©D2 A nDi,D2y 

i=0 

+ iVne„ + 4n log I A"! + 2n log 13^1 . 



(27) 



(28) 



Dividing by nN and introducing the random variable Q uniformly distributed on {0, 1, . . . , n — 1} and independent 
of the others we get: 



If iV ^ 00 then 



Ri + R2< 

< 2^^Xi^i^Di, X2.i®D2 A Yi\Di,D2) 

+ + (29) 

N-l 

< I(^i,Qe-Dii^2,Qe£i2 A Yq IQjDi, 152) 

, 21og|y| , 41og|A'| 
+ en + ^^ + ^^ (30) 



Ri+R2< I(Xi,Qei3i , ^2,QeD2 A Yq\Q, A, Z?2) + £«■ (31) 



Corollary 1. Under the assumptions of Theorem^the following bounds hold in the 2-senders case: 

Rl<^Xl,Q^YQ\X2^QQD,Q.D)+er, (32) 

R2<l{X2^QeDrsYQ\Xi^Q,Q,D)+en (33) 

Rl+R2<l{X^,Q,X2.QeD^YQ\Q,D)+en■ (34) 

Here Q is uniformly distributed on {0, 1, . . . , n — 1} and independent of Di, D2 and the message flows of the senders, 

Q denotes the subtraction modulo n, D = Di Q D2 is the relative delay between the two senders and Yq is linked to 
Xi^Q, X2^QQD through the channel W. 
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Proof: Expand the right sides of the equations ([9|),( 10 l,pT| as sums for the possible values of Q,Di,D2, e.g. 

KXi^Q(^Di, X2,Q(SD2 A Yq\Q, Di, D2) = 

= EEE ^ ^ di)Pr(i^2 - d2)I(Xi,,edi,^2,9ed. A (35) 

q di d2 

Substituting q' = q (B di and d = diQ d2, and renaming q' to g, the Corollary is proved. ■ 



4 Known capacity regions with a new insight 
4.1 The asynchronous one-sender model 

In this section the asynchronous model from section 2 is analyzed where there is just one sender {K — 1). This will 
provide the basics for the decoding method of the K-AMAC in general. 

In case of K = 1, W : X ^ y denotes a classical DMC. For the sake of clarity, we omit from the notations 
of Section 2 the index corresponding to the unique sender. Let {x(l),x(2), . . . ,x(A^)} denote the codewords of the 
code-book of the sender, where A4 — 2"^ is the number of codewords in the code-book of the sender. The coordinates 
of x(z) are denoted by {xo{i),xi{i), . . . ,a;„_i(«)). 

The difference between this model and the classical one is that the task of the receiver is not just decoding the 
codewords but also to find the beginning of the codewords. 

Theorem 2. For each version of the model the capacity region of the one sender asynchronous model is the same as 
that of the classical model: [0, maxp(/(p, T4^))], in case of arbitrary delay system. 

Remark 8. By the achievability proof below, beyond decoding the codewords, the receiver also finds out the delay of 
the sender, with probability exponentially close to 1. 

Proof of Theorem [![■ The converse part follows from Theorem [T] 

In order to prove the direct part it is enough to restrict attention to uninformed L-block decoder and to maximal 
error; actually L — 1 suffices. Classical random argument is used. Let p be an arbitrary distribution over the input 
alphabet X . Chose the symbols of codewords in the code-book of rate < i? < /(p, W) independently according to p. 
Let P"(x",y") be the joint distribution on x J^" induced by the n-th power of p and by the memory less channel 
W . Let be the marginal of P" on 3^". We define the decoder as follows. In order to estimate the 0-th sent message 
Afi.O; the receiver first examines the n-tuple of outputs {Y^n+i,Y-n+2, ■ ■ ■ ^Yq), then it examines the next n-tuple 
(y„„+2j ^-n+3i ■ ■ ■ ,Yi), etc. until the n-tuple {Yq^ ^i, • ■ • Yn-i). The estimate will be Mi g = s if among the examined 
n-tuples there is a unique one denoted by F", for which {{Xq{s) , Xi{s) , . . . , X„_i(s)), y") belongs to the typical set 



:= (x",y 



1 P"fx" v") 



(36) 



and also this s is unique. 

It can be assumed that Mi.o is fixed, say Mi^ = 1. It is clear from the classical channel coding theorem that 
if the decoder examines an n-tuple Y" which is the output of the whole codeword X(l), then the decoder will find 
X(l) but no other codewords jointly typical with F", with probability exponentially close to 1. Hence we only have 
to discuss the cases when the decoder examines output symbols in a window of length n, in which one part of the 
channel input symbols are coming from the 1st codeword and the other part from another codeword r. The probability 
that Mi^^i or Mi^i is equal to 1 is exponentially small. Hence it can be assumed that r 7^ 1. Furthermore, it can 
be assumed that the input window starts with the r'th codeword as the opposite case is similar. Hence, the channel 
input symbols in the examined output window can be written as (X„_;(r), . . . , X„_i(r), Xo(l), . . . , X„_/_i(l)) for 
some r e {1, . . . , M}, n > I > 0,r ^ 1. We will show that the probability of incorrectly recognizing typicality in this 
window is small. The probability, conditioned on the previously presented structure of the examined window, that 
the s'th codeword will be typical with this examined output n tuple can be written as: 



Pw<i{(^i(s),...,x„(s),ri,...,...,r„) e s^} 

p"(x"(s)) • Pr,„„rf {(Fi, . . . ,y„) = y"|(Xi(s), . . . ,X„(s)) = x"(s)} 



E 

(x"(s),y")e5f. 
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g"(y") 
9"(y")' 



< 



(x"(s),y")eS^ 

Pw {(11, . . . , y„) = y"|(Xi(s), . . . , X„(s)) - x"(s)} 
2-n(/(p,VK)-5) p"(x"(s), y") 



E 

(x"(s),y")eS: 



g"(y") 

,y„)-y"|(Xi(s),...,X„(s)) 
2-n{i{p,w)-5} F"(x"(s)|y")- 



x"(.)} 



(x"(^),y")es^ 
.,y„)=y"|(Xi(s),. 



,X„(s))=x"(s)}. 



(37) 



In the above derivation the definition of the set is used, and in the last equation P"(x"(s)|y") denotes the 
conditional probability induced by the joint distribution P". At this point we have to use the structure of the examined 
window. It is known that in this window the second part of the r-th codeword and the first part of the 1st codeword 
were sent. We should distinguish three cases: {s r, s ^ 1}, {s ^ r, s — 1}, {s — r, s ^ 1}. The first case follows 
from the fact that Pi^ond {(^i, • • ■ , Y„) = y"|(Xi(s), . . . , X„(s)) = x"(s)} is equal to Fi.ond {(^i, ■ • • , Y„) = y"}. The 
remaining two cases can be treated very similarly. For the sake of brevity we will demonstrate the case {s ^ r, s — 1} 
when the expression ( [37| is bounded from above by 

n-l 



E 

(x"(l),y)GA'"xJ^" L/i=o 



Lh=0 J Lh=l 

Sum in the following order: a;„_i(l), y„-i, a;„_2(l), yn-2 



' 2-n(I(p,W)-S) 



YlW{yH\xh^iil)) . 

a;;(l), yi, we get the following expression: 

i-i 



(38) 



E P"(x"(r)) E 



.h=0 



7-1 



n W{yh\Xn-l+rir)) 



./i=0 



^ 2-n(I(P,W)-S) 



(39) 



Indeed, the inner sum is equal to 1, because its terms may be regarded as a joint distribution of a memory less channel 
with inputs yi and outputs 0:^(1). 

In the above derivation we demonstrated that the probability that the receiver finds the s-th codeword typical 
with an output n tuple whose input symbols consist of two different codewords, can be bounded from above by 
2-"(^(p.>^)-'5), Using the union bound over all codewords and over all the n tuples examined by the decoder, gives 
that the probability of recognizing one of the codewords in a window where the inputs are from two different codewords 
is less then nA^2-"(^(P'^)~*). 

The above argument shows that the maximal error over delays of the random code is exponentially small in n. 
Hence there exists a sequence of deterministic coding-decoding systems with exponentially small maximal error over 
delays. Optimizing the distribution P, we can see that maxp I{p,W) is achievable. ■ 



Remark 9. The crucial step of this proof, the summation in equation ( 38 1 is adopted from Gray [8] . 



Remark 10. Note that using [S] a stronger result can be proved: any sequence of deterministic codes which work well 
for the classical channel coding model, can be modified to work well for the asynchronous model. Namely, if the same 
random sync sequence of length k w log^(n) is appended to each of the original codewords, then with probability 
tending to 1 it is possible to detect the sync sequence and decode the original codewords. This is how Theorem [2] was 
proved in the first version of the paper. The authors are indebted to anonymous reviewers for pointing out that the 
proof becomes somewhat simpler if no sync sequence is used. 

4.2 The totally asynchronous case 

From this point on, the paper strongly relies on the results of [7] and [T^]. Though the reader is assumed familiar with 
the concepts of successive decoding and rate splitting, the basics will be summarized below. 
Let be a channel with K senders. 
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Definition 10. The convex polytope TZ [W;p{xi, X2, • ■ • , Xk)] is the set of rate tuples R G (M+)^ such that 

R{S)<I{XsAY\Xs^),SC[K], (40) 
where Y and Xi, X2, . . . , Xj^ are connected by the channel W , and the joint distribution of Xi, X2 . . . , Xj^ is p{xi, X2 ■ ■ ■ , x^)- 
Definition 11. Let C denote the following set: 

C y TZ [W-px, xpx,x---x px^] (41) 

where the union is over all product distributions. 

Definition 12. The set of rate tuples {Ri, R2, . . . , Rk) from TZ[W;pxi x PX2 x ■•• x px,^] for which R{[K]) = 
I(X[x] AY) is the dominant face ofTZ[W;pxi ^ PX2 x ■ ■ ■ x PXk\- is denoted by D{TZ [W;pxi x px2 x ■ ■ ■ x PXk])- 

Definition 13. We say that (i?i,i?2: • ■ • ,Rk) is dominated by (i?i,i?2, • ■ • , Rr) if Ri < -^i; R2 < R2,- ■ ■ > Rk < R-K- 

It can be seerj^ that the points of D{TZ[W;pxi ^ PX2 x ••• x px^]) cannot be dominated by other points of 
TZ [W;pxi X PX2 X ■ ■ ■ X pxk]: but any point from TZ [W;pxi x px2 x ■ ■ ■ x pxk] can be dominated by a point from 
the dominant face. 

Remark 11. According to Definition[9] if (i?i, i?2, . . . , Rk) is dominated by an achievable rate vector then (i?i, R2, . . . , Rk) 
is also achievable. 

Recall from [7] the description of the vertices of D{TZ [W;pxi x px2 x ■ ■ ■ x PXk])- Let tt = (tti, 7r2, . . . , ttk) be an 
ordering of [K]. For all i e [K] let R^, be equal to I{Xyy. A F|X{7ri....,7ri_i})- For example ii K — 3, and tt — (2, 3, 1), 
then R^ = I{X2 A Y), R^ = /(X3 A Y\X2), Rl = I{Xi h Y\X2,Xj.). Then the rate vector = {Rl.Rl, ■ ■ -^Rk) 
is a vertex, and all vertices of D{TZ [W;pxi x px2 x ■ ■ ■ x PXk]) can be written in this way with appropriate tt. Note 
that the vertices R'^ need not be all distinct. 

In the Appendix of [7_ it is proved for informed L — K block decoder that in the totally asynchronous case R^ S 
D{TZ [W;pxi X PX2 X ■ ■ ■ X PXk]) can be achieved by successive decoding with ordering tt. We summarize the proof for 
j^{i,2,...,if}^ The coding/decoding system is randomly constructed the following way. The symbols of the code-books 
of the senders axe chosen independently according to the appropriate input distributions. The receiver first decodes 
by joint typicality the codewords of the first sender, considering the random codewords of the other senders as noise. 
This means that the receiver behaves as if there were only one sender and the channel was the following: 

W\y\xi)= ^ ■■■ Px2i^2)pxA^3)---PxKixK)W{y\xi,x2,...,XK). (42) 

X2€X2X3eX3 X^eXK 

Next the receiver decodes the codewords of the second sender by joint typicality using the already decoded codewords 
of the first sender, considering the other senders as noise. This means that the receiver behaves as it would in a one 
sender model when the channel was the following: 

W^iy, xi\x2) ^ ^ X! '■■ X! PxAxi)PX3ix3)---PXK{^K)W{y\xi,X2,-.-,XK)- (43) 

XseX'j X4,£X3 Xk€XK 

The codewords of the other senders are decoded similarly. In the final decoding step the receiver decodes the codewords 
of the iiT'th sender by joint typicality using the already decoded codewords of all the other senders. This means that 
the receiver behaves as it would in a one sender model when the channel was the following: 

W^{y, xi,X2, . . .,xk-i\xk) = pxi{xi)px2{x2) ■ ■ ■ PXK-i{^K~i)W{y\xi, X2, ■ . . ,Xk)- (44) 

More detail can be found in the Appendix of [7]. 

Now recall the notion of individual split from [7]. A split of sender i with input distribution px^ on Xi = 
{0, 1, . . . , — 1} results in two virtual senders ia, ib with distributions pxi^ and pxi^i also on Xi, explicitly determined 
by pxi and a splitting parameter, such that the splitting function /(xa, xi,) = raax{xa, Xb) maps pxia ^ PXi^ into px^ . 

Section 2 of [7j shows in the totally asynchronous case that each R e D{TZ \W\pxi x px2 x ■ ■ ■ x PXk\) can be 
achieved with Rate Splitting via at most K —1 splittj^ This means that a good code for W with rate vector R can be 
obtained from a code with successive decoding for an auxiliary channel with 2K — 1 virtual senders constructed by 
splitting the original senders, perhaps some of them split repeatedly and others not at all; the rate vector of this code 

equals the vertex R''^ of the dominant face of TZ W^]p-^_^ x pj^^ x ■ ■ ■ x Px^^ ^ for some ordering tt and distributions 

Pxi ^Px2 ^ ' ' ' ^Px2K 1 ■ particular, the i'ih coordinate of R is the sum of those coordinates of R''^ that correspond 
to the virtual senders into which the i'th sender has been split, i — 1,2, . . . , K. 



[Tj states it as a consequence of the fact that TZ [W; pxi X PX2 X ■ ■ ■ X pjfj(^] is a polymatroid, which was observed in |9], |13| . 
''The stronger result of Section 3 of [7] is not necessary in this paper. 
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Figure 3: Time sharing when the relative delay is uniform on even numbers 

Theorem 3. In the totally asynchronous case (Example^, for each model version the capacity region is C. 

Proof: In the converse part it is enough to treat the case of an informed infinite decoder and average error. The right 
side of eq. ([8| can be bounded from above as follows. 

l{^s,Q(BDs A fglXs.^Qe^sc , Q, D) = (45) 

= H(fQ|Xsc,QeDsc , Q, D) - H(fQ|X[^],QeD[,j , Q, D) (46) 

= H(yQ|Xsc,Qei5.c , Q, D) - H(yQ|X[^],Qei?,.j ) (47) 

< H(yQ|Xse,QeD.c) - H(yQ|X[A'],Qei5[.,) (48) 

= I(Xs,QeDs A YQ\y.s^,Q(BDsc ) (49) 



where (47 1 comes from the fact that the output depends only on the input variables. 

From the fact that the delays are independent and uniform it follows that the random variables {Q (B Di,i E [K]} 
are independent, hence the random variables {Xi^Q^D. , i G [K]} are also independent. On account of this, the converse 
statement follows from Theorem [T] 

The achievability part needs one modification of the proof in the Appendix of [7_ of the assertion that C is achievable 
with informed L = 2K — 1-block decoder, considering maximal error. In order to get rid of the assumption that the 
delays are known to the receiver, it is enough to use the synchronization method from Subsection |4. 1 1 in the successive 
steps of achievability of vertices. ■ 

Remark 12. In case of two senders Corollary[T|leads to a stronger result. If the relative delay D = D1QD2 is uniformly 
distributed on the set {0, 1, . . . , n — 1} then, for each model version, the capacity region is C. 

5 Even delays 

In [B^ an artificial but interesting (from theoretical point of view) model is mentioned as open problem: the possible 
delays are in the set {0, 1, ... , an} for some a E (0, 1). In this section, though this problem is not solved, a similar 
artificial model is analyzed which also has theoretical interest. 

Theorem 4. In the even delays case (Example^, for each version of the model the capacity region consists of those 
rate pairs that either belong to C or are linear combinations with weights ^, ^ of points in C . Moreover, using 
coding /decoding systems of odd length, only C can be achieved. 

Proof: In order to prove the direct part it is enough to restrict attention to uninformed L-block decoder and to 
maximal error. Let n be even. Then the senders can do time sharing with weights i, ^ using separately the even and 
the odd symbols and using the coding/decoding method of Theorem [s] Figure [s] demonstrates this fact. Note that in 
this case L can be chosen as 3. 

In the converse part it is enough to treat the case of an informed infinite decoder and average error. The proof 
uses Corollary [1] 

In case of coding/ decoding systems of even length the relative delay is uniformly distributed on the even numbers 
in {0, 1, . . . , n — 1}. Write the upper bounds in Corollary [l] as a sum for the possible values of Q and define two 
random variables Qi, Q2 as uniform on even/odd numbers and independent of each other and everything else. Then 
the following can be written: 

Ri<l{Xi^QhYQ\X2,QeD.Q.D) + en (50) 

^ n— 1 

= -y2l{Xl^,^%\X2,^eD,D)+en (51) 



i=0 



'2n 

i^odd 

+ V l{Xi,, ^%\X2,^eD) + (52) 
+ \^X^^Q^^YQ,\X2,Q,eD)+en■ (53) 
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0.2 0,4 0.6 0.8 1.0 

(a) Capacity region in the totally asyn- 
chronous case 



0.2 0.4 0.6 0.8 1.0 

(b) Capacity region in the even delays case 



Figure 4: Change of the capacity region with the change in the distribution of the delays 



Similarly we get 



R2 <^l{X2,Q,eD ^YQ^X^,Q^)+ 



(54) 



Ri + i?2 <2 I(-'^i,Qi'^2,Qieci ^ ^Qi)+ 



(55) 



where Xi ,jr2^Qje£) and Xi q^,X2^q2qd ^^re independent. This proves the converse result for even blocklength (see 
|1] Lemma 14.4+ , or its generalization, Lemma |2] in Section 6 of this paper). 

In the subsequent part of this proof the symbol n denotes odd integer. Now we prove that with coding/decoding 
systems of odd length, just the union of the pentagons can be achieved. Given such sequence of coding/decoding 
systems, where P" — >■ 0, let c„ be a sequence with c„ — and > such that c„n is integer. 

Recall that the delays Di{n) and D2{n) are independent and uniformly distributed random variables on the set 
{0, 2, . . . , n — 1}. For all i G {0, 1, . . . , n — 1} let K{i) be the number of those pairs di,d2 € {0, 2, . . . , n — 1} for which 
the relative delay d ~ di Q d2 is equal to i, then: 



2 

i+1 
2 



if i is even 
if i is odd. 



Let Di{n) and D'2{n) be two random variables taking values in the set {0, 2, . . . , n — 1} with the following joint dis- 
tribution. For each di,d2 £ {0, 2, . . . , n — 1}, if cii G(i2 € {c„n, c„n -1-1, . . . , n — 1 — c„n} let Pr {D[{n) = di, D'2{n) ^2} 



be equal to 
large enough: 



n(l-2c„)K(died2) 



otherwise 0. Then for each di,d2 G {0,2, . 



1} the following bound holds if n is 



Vr{Di{n)^di,D2{n) = d2} > 



(n+l)2 

> c„Pt {D[in) ^ du D'^in) ^ d2} . (56) 

Using the same idea as in the proof of Lemma [l] and the fact that ^ >■ we can conclude that the given sequence 

of coding/decoding systems has average error also tending to under the delay system D' described by the random 
variables D'i{n) and D2{n). Hence if we show that under delay system D' only C can be achieved, the assertion is 
proved. 

Under delay system D' the relative delay D'{n) — D[{n) Q D'2{n) is uniformly distributed on the set {c„n,c„n -|- 
1, . . . , 71 — 1 — c„n}. By Corollary [1] the following bounds hold for the rates: 

Ri < K^i Q A YQ\X2.QeD', 
R2 < l{X2,QeD' ^YQ\X^,Q,Q,D') + e^ 
Ri+R2< l{Xi,Q,X2,QeD' A Yq\Q, D') + £„. (57) 
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Let D{n) be a random variable uniformly distributed on the set {0,1, ... ,n— 1}. As the variation distance between 
the product joint distributions of {Q, D') and {Q, D) tends to 0, the following differences also tend to as n — > oo: 

I(^i,Q, ^2.Qei5' A Yq\Q, D') - ^YQ\Q,D), 
I(Xi,Q A yQ|X2,QeD', Q, D') - A Yq\X2^qqd,Q, D), 

I(X2,QeD' A Yq\X^,q, Q, D') - liX^.QQD A ^qI^i.q, Q, D). (58) 



Taking into account that Xi q and X2 qqd are independent, the assertion is proved (See also Remark 12 1 



Example 4 

There are two well-known examples (J^, J^) which show that the convex hull operation can be useful. Here we 
use 0/. Let the channel be defined by Xi = X2 ^ y = {0,1}, W{0\0,0) = 1, T4^(l|l,0) = Vl/(1|0, 1) = 1 and 
W{1\1,1) = VF(0|1,1) = i. The capacity regions in the totally asynchronous and in the even delays case are shown 
on Figure \Ja\ and on Figure [T&l In the latter case a hill appears in the middle of the picture. 

Remark 13. Remarkably, it does not seem possible to extend this result for uniform relative delay distributions on 
{0, 1, . . . , ri/2{, although this distribution has the same entropy as the uniform distribution on even (or odd) numbers. 
Similar results can be achieved if the distribution is uniform on numbers which are divisible by 3. In this case time 
sharing with weights |, | becomes possible. 

Remark I4. This also means that if the senders of the totally asynchronous AMAC want to time share with weights 
(^, ^), they can do that if a one-shot 1-bit side-information about the delays is available to the senders. 

6 Partly asynchronous three-senders case 

In this section we will prove coding theorem in case of K ~ 3, when Di{n) = D2{n) and D^{n) are independent and 
uniformly distributed on the set {0,1, . . . ,n ~ 1}. 

Theorem 5. In the partially asynchronous three senders case (Example^, for each version of the model the capacity 
region is 

U Conv U nlW; px, x px, x ^^3] • (59) 

PX3 \PXi Xpx2 / 

In words, it consists of the convex combination of rate triples from C whose corresponding convex polytopes are defined 
by the same third distribution. 

Remark 15. Using the Caratheodory-Frenchel Theorem (e.g. Chapter 15 of [4) in Theorem[5j it suffices to take convex 
combinations involving at most three rate triples. 

Proof of the converse part in Theorem^ It is enough to treat the case of an informed infinite decoder and average 
error. Theorem [T] can be used as follows. 

If 5 C {1, 2, 3} then the following bound holds: 

n ^ 

R{S) <^-I(Xs,,ei3s AF,|Xsc,,eDso,D)+e„. (60) 
Summing over the possible values of Di ~ D2 we get the following bounds: 

i?i<VV--- 

^ ^ nn 

i=0 d=0 



1(^1, ierf A Yi\X2,i(Sdi ^3,1^03 , D3) + £ri 

1 1 

n n 



■>3 

n—1 n—1 



• I(-'^2,jed A Yi\Xi^i(Qd, X^^i^D^ , D^) -f £„ 

n—1 n—1 



^3<EE'' 



71 n 

i=0 d=0 



1(^3, ieDs A Yi\Xi^i(^d, X2^i^d, D3) + £ri 
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i?2 + i?3 < 



n — 1 n — 1 ^ 

EE--- 

■ K^l,iS)d, ^2,1^x1 A li 1X3^^01)3 , -D3) + £„ 
n — 1 n — 1 

EE--- 

i=0 d=0 

1 1 

n n 



71 — 1 71 — 1 



i?i+i?3<EE 

i=0 d=0 



Ri + R2 + R3< 



1 1 



1=0 d=0 

• K^lAiSd, X2,i(Sd, X^^i^D^ A FjIDs) + £„. (61) 

Note that, X^ i^jj^ is independent of the other variables, and has the same distribution for all i. Note also that 
the above inequalities can be overestimated by dropping from the condition (same argument as in Theorem |3| . 
Hence the converse part follows from Lemma |2] below. ■ 

The achievability part in Theorem [5] is proved later in this section. 



Lemma 2. Given n sets TZ 



«e{i,2. 



W;pxi X pxi^ X px. 

with weights cn of n vectors from these sets if and only if they are contained in TZ{ai , 02, . . . , a. 
the following inequalities: 



, n}, a vector (i?i, i?2, R3) equals a convex combination 

which is defined by 



< Ri <J2c^^HXl ^Y'\XIX'^) 



1=1 



i=l 



0<i?2 < > ;a,/(X^Ar^|Xi,X^) 
aJ{X',AY'\Xl,Xi) 

i?2 <^«»/(xj,x^Ar*|x^) 

i=l 
n 

R3<Y.aJ{Xi,X^,AY'\X'^) 

n 

R3<Y.aa{XlXlAY^\Xl) 

n 

Ri+R2 + R3<Y,aJ(XlXl,XlAY'). 



< i?3 < 

Ri 
Ri 

R2 



(62) 



Proof: This proof follows the proof of Lemma 14.4+ in [4 . The sets TZ W; Px^ x pxi x and the set Tl{ai , a2, . . . , a„) 
satisfying the above conditions are convex polytopes with 16 vertices. Using the fact that the mutual and the condi- 
tional mutual information are always non-negative, it can be easily derived that there are no redundant inequalities 



between the defining equations of the sets TZ 



W;px' X px^ X pxi 



and TZ{ai, a2, • ■ • , ctn)- This means for example 



that it is not possible that the sum of the bounds for Ri + R2 and R3 is strictly less then the bound for Ri + R2 + Rs- 



Using this fact the vertices of TZ W;pxi 'x Pxi ^ Pxi can be written the following way. First = (0,0,0) is a 
vertex. The remaining 15 vertices can be divided into three groups of equal size. The first group consists of those 
vertices (Ri, R2, R3) for which i?i is equal to its own bound, i.e, of the vertices: 

,1 



^{I{XIAY'^\XIXI),0,0) 
v2 = iI{Xl AY^\XlX^,),IiX'^ AY^\X10) 
rl = {I{X{ A y'|X^,X^),/(X^ A Y'\XII{XI A Y') 
= {I{X\ A Y^\XIXI), 0, 1(Xl A r'|x^) 
\ = {I{Xl A Y'\XlXl),I{Xl A Y'),I{Xl A r'|X^) 



(63) 
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The two other groups (v^, vj, . . . , v,^") and {vj^jvl 



Vj"") are obtained similarly. 



Note that TZ 



distinct. The vertices of TZ{ai,a2, 



can be degenerate in the sense that these sixteen vertices need not be all 
a„) are the points X^ILi'^^^i' ^ — J — 15. As these vertices are contained 



W;px' X _pxi X px^ 



therefore 



in the (convex) set of convex combinations with weights ai of vectors in the sets TZ 
whole TZ{ai, a2, ■ ■ ■ , an) is contained. The reverse inclusion is obvious. ■ 

By Definition 12 the points {Ri, R2, R3) of D{TZ[W;pxi x px2 x PX3]) satisfy the inequalities in (40l, with Ri + 
R2 + = I{Xi,X2,X3 A Y). An edge of this dominant face is characterized by another inequality in (40 1 fulfilled 



with equality. The set S corresponding to that inequality will be called the type of this edge. 

The following lemma states that rate triples lying on edges with a fixed type behave similarly in context of rate 
splitting and successive decoding. It can be considered as a remark to the general theory of j7] , |T5] in the special case 
K = i. 

Lemma 3. For every fixed nonempty S C {1, 2, 3} there exists a ^.-senders channel W' derived from W by splitting the 
first or the second sender, and an ordering vr = (tti, 7r2, tts, 774) of the 4 senders with the following property. IfW is de- 
rived from W by splitting the first sender, then to any input distributions px^tPx^^Px^ ofW and for all (Ri, R2, R3) € 
D(TZ [W;pxi X PX2 X PXg]) lying on the edge of type S, there exist input distributions px^^tPXi,, md non-negative num- 
bers Ria,Rib with Ria + -Rib — Ri such that {Ria, Ru, R21 R3) is the vertex of D{TZ [W';pxia x PXi,, x px2 x PX3]) 
described by ordering tt. // W' is derived from W by splitting the second sender, then to any input distributions 
PXi,PX2iPX3 ofW and for all {Ri, R2, R3) € D{TZ [W;pxi x px2 x PX3]) lying on the edge of type S, there exist input 
distributions Px2aTPX2b '^'^'^ non-negative numbers R2aiR2b with /?2a + -^26 = R2 such that (Ri, R2a, R2bT R3) is the 
vertex of D(TZ[W';pxi x Px2a x PX2b x PX3]) described by ordering tt. 

Proof: Assume for example that a rate triple {Ri, R2, R3) G D{TZ[W;pxi ^ PX2 ^ PX3]) lies on the edge of type 
S = {1,3}, hence i?i +i?2 + -R3 = I{Xi, X2, X3 AY), i?i +i?3 I{Xi, X3 AY\X2). The other cases are similar. Then 



i?2 = I{X2 A Y) and (.Ri, i?3) hes on D{n 



W;px^ X PX3 



where W{y, X2\xi, X3) ^ Px2{x2)W{y\xi,X2,X3) (see HI], 

beginning of section 3c) . Denote by W' the three senders channel derived from W by splitting the first sender. We 
could split the third sender instead of the first sender, but we want to leave the third sender unsplit. Using the basic 
rate splitting result of [7] for two senders channels, there exist input distributions Pxi^tPXu, ^^nd non-negative numbers 

-Ria, -Rib with i?ia-|-i?if, = i?i such that (i?ia, i?3) is the vertex of £>(??. W';pxia x PXi^ x ) described by the 

ordering (la, 3, 16). Hence, {Ria, Ru, R2, R3) is the vertex of DiJZ\W';px^^ x px^^ x x PX3]) described by the 
ordering (2, la, 3, 16), where W is the 4-senders channel derived from W by splitting the first sender. This argument 
shows that for S — {1, 3}, the channel W' derived from W by splitting the first sender, and the ordering (2, la, 3, 16) 
on the senders of W fulfill the requirements of this lemma. ■ 

The next lemma shows that each x which is not in C but can be written as the convex combination of rate triples 
from C, can be dominated by a convex combination of rate triples from C which lie on edges of same type. 



Lemma 4. Given k sets TZ 



W;px' X px^ X px^ 



, i G {1,2, . . . ,fc}, if a vector x is not in C, but can be written 



X — X]j=i Q^i^i; where G TZ 



W; Pxi X Pxi, X Pxi 

k 



, < < 1, « G {1, 2, . . . , fc}, X]i=i '^i ~ 1' then x can be 
dominated by an x' which can be written as X]i=i "^i^'i; where x^ G D{TZ W;pxi ^ Px^ ^ Px^ )? < < 1,* G 
{1, 2, . . . , k}, X^iLi Q^'i ^ 1 '^nd the vectors x'^, i G {1, 2, . . . , fc}, lie on edges of same type. 



Proof: It can be assumed that at > for all i. If x^ is not on D{TZ W;px^^ x pxi x px^ ) then we can take a 
dominating x^ from the dominant face, for all i. Then x — X^iLi Q^j^j dominates x. So it can be assumed that the 



rate triple x; is on D(TZ 



W;pxi X pxi X px^^ 



for all i. 



The dominant face of a set TZ 



W;px^ xpxi xpxj 



on a plane with normal vector (1, 1, 1). We say 



is a hexagor 

that the height of the plane with normal vector (1, 1, 1) is a if its equation is x-\ry+z = a. The height of a dominant face 
is the height of its plane. As in Lemma[2]let us consider the set TZ{ai,a2, . . . , a^). This is the set of convex combinations 



with weights ai,l < i < k of the sets TZ 



W;px] X pxi X pxi 



,1 < i < k. The dominant face T>{ai, 02, . . . , ak) of 



TZ{ai,a2, ■ . ■ , Oik) consists of those points (-Ri, -R2, -R3) for which -Ri + -R2 + -R3 = J2i=i otiI{Xl,X2, X^ A F*). Note 
that X G T>{ai, a2, ■ . ■ ,ctk) because the points Xj are on the dominant face of TZ W;pxi x px^ x px^ respectively. 



Any edge of T>{ai, a2, • • ■ , a/c) consists of those points for which one of the inequalities (62 1 is fulfilled with equality. 
Hence the edges of T>{ai, 02, . . . , 0;^) consist of points which are convex combinations with weights ai, a2i ■ • ■ : Q^fc of 
points lying on edges of same type. If x is on an edge of T>{ai,a2, . ■ . ,ak) then we proved the assertion. Hence 
it can be assumed that x is an inner point of T>{ai, a2, ■ ■ . , ctk)- Suppose first that there exists m,l such that 



*The hexagon can be degenerated since some vertices can be identical 
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Figure 5: The set of convex combination of two dominant faces. One of them is degenerate (triangle). 



/(Xf A y™) > I{X[,X^^,Xl A y')- Let us define new weights: If i ^ m,i ^ I then let a[ = a,., and let 

Q!,„ — am + £, Of; = ai — e. Then the height of ^{ai, a^, ■ ■ ■ , a^) is larger than the height of I?(ai, Q!2, ■ . ■ , ctk)- If 
e is small then one of the points of 'D{ai,a2, ■ ■ ■ will dominate x. We increase e until this property holds or 
until becomes 0. Then, using continuity, an edge point of I?(ckxi ^^21 ■ ■ ■ 1 Q^fe) '^i^^ dominate x or o?; = holds. This 
argument shows that it is enough to restrict attention to the case when I{X™, X™, A Y"^) ~ I{X[,X2, Xj^ AY'') 



are in the same plane. Using 



for all 771, 1. This means that the dominant faces of sets TZ 

again the continuous change of 'D{ai, a2, ■ . • , ctk)'- if c^i ^ 1, and Oj — >■ if j ^ i, then I?(ai, a2, . . . , ak) tends to 

D{TZ W;pxi X px| x ). As x is not in C, it is not in D{71 W;pxi x pxi x pxi ) for either i £ {1,2,..., k}, 

hence there are weights a^, ■ ■ ■ , (xl, for which x is on an edge of 'D{a\, ... , a^,). So it is a convex combination of 
points lying on edges of same type. ■ 

The proof of achievability in Theorem [5[' In order to prove the direct part it is enough to restrict attention to 
uninformed L-block decoder and to maximal error. 

Theorem [3] shows that the rate triples of C can be achieved in the totally asynchronous case considering uninformed 
L = 5-block decoder with maximal error. It follows that in the partially asynchronous three senders case, C is also 
achievable considering uninformed L = 5-block decoder with maximal error with the same coding/decoding method. 
By Remark 15 it is enough to consider points which are not in C but can be written as the convex combination of two 
or three rate triples from C whose corresponding sets TZ [W;pxi x px2 x pxs] have the same third distribution. Note 
that the following part of this proof shows that in the latter cases L — 4-block decoder suffices. For the sake of clarity 
we deal only with points which can be written as the convex combination of two rate triples whose corresponding sets 
TZ [W;pxi X PX2 X PX3] have the same third distribution. The case of convex combination of three rate triples can be 
derived similarly. 

Let i?2, ^3) be in D{TZ [W;pxi x px2 x PX3]) and (^1, ^2,-^3) in D{TZ [W;px^ x Px^ x pxs])- Note that the 
third input distribution is the same in case of both convex polytopes. We want to show that a{Ri, R2, R3) + (1 — 
a){Ri, i?2, ^3) can be achieved in the partially asynchronous three senders case, a & (0, 1). Using LemmaUJit can be 
assumed that {Ri, R2, R3) and {Ri, R2, R3) lie on edges of same type. Without loss of generality it can be assumed 
that this common type is S" = {1, 3}. 

Let W' and tt be the 4-senders channel and the ordering in Lemma [s] for S = {1,3}. From the proof of Lemma [s] 
it can be seen that W' is the first sender split version of W and tt = (2, la, 3, 16). As a consequence of Lemma [S] there 
exist Ria,R2a with Ria + i?2a = ^1, Ria, R2a with Ria + R2a = ^1 and input distributions px^^,PXih,Px^^,Px^^ 

such that i?2, ^3) and {Ria, Ru, R2, R3) are those vertices of D{TZ\W' ■,pxi„, ^ PXit x pxa x px.,]) and 

D{TZ [W';Px_^ X Px_^^ x pj^^ x pxa]) respectively, which can be described by ordering tt. 

If a sender is split, then the delays of the two virtual senders are equal to the delay of the original sender. Hence 
it is enough to prove that a{Ria, Ru, R2, R3) + (1 — a){Ria, Rib, R2, R3) can be achieved for channel W' when the 
delay system is the following: Dia{n) — Dii,{n) — D2{n) and D3{n) are independent and uniformly distributed on the 
set {0, 1, . . . , n — 1}. 

Note that the coordinates of the 4-tuple {Ria, Rib, R2, R3) can be described as follows: R2 — I{X2 A Y),Ria = 
I{Xia A Y\X2), R3 = I{X3 A Y\Xia,X2) and R^ = I{Xu A F X2, X3), where the joint distribution of {Xia 
, Xib , X2 , X3, Y) is determined by the product input distribution px^^ x pxi^ x px^ x px-^ and the channel 
transition W . Similarly the coordinates of the 4-tuple {Ria, Ru, R2, R3) can be described by the equations: R2 — 
I{X2 A Y),Ria = I{Xia A Y\X2), R3 = HX3 A Y\Xia,X2) and Ru = I{Xu A Y\Xia, X2, X3), where the joint 
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distribution of {Xia, Xn,, X2, X3, Y) is determined by the product input distribution p-^^ x Px^^ x p^^ x px.^ and the 
channel transition W' . 

Random coding argument is used, assuming without any loss of generality that an and (1 — a)n are integers. The 
symbols of the random code-books are independent but not identically distributed random variables. The codewords 
of the virtual senders la, lb, and sender 2 consist of two parts. The first an symbols have distributions PXiaiPXu ^'^d 
PX2 respectively, while the last (1 — a)n symbols have distributions Px^ iPxib ^^'^ Px^ respectively. The symbols of 
codewords of sender 3 are identically distributed according to the distribution px^ ■ We show that with this code-book 
structure it is possible to achieve the rate tuple a{Ria, Ru, R2, R3) + (1 — ct){Ria, Rib, R2, R3), by successive decoding 
with ordering (2, la, 3, lb) for channel W' if senders la, 16, 2 are synchronized but sender 3 is not synchronized with 
them. 

Note that we do not assume that the receiver knows the delays. 

First the receiver decodes the codewords of sender 2. The situation is now more complicated than in case of 
identically distributed symbols. From the receiver's point of view the codewords of the second sender go through two 
different channels according to the different symbols of the codewords of the other senders. From the fact that the 
senders la, 16, 2 are synchronized the receiver knows that the first an consecutive symbols of codewords go through 
the channel 

W^{y\x2)^ ^ ^ ^ PXiAxia)PXitixu)PX3{x3Wiy\xia,Xlb,X2,X3), (64) 

xia&Xi xit,eXi X'jex^ 

and the last (1 — a)n consecutive symbols of codewords go through the channel 

VV'^{y\x2)^ X! X! ^ PXiA^ia)PXiS^u)PxA^3)W{y\xia,Xib,X2,X3). (65) 

xia&Xi xit,eXi X'jex^ 

The decoder does the following. As in Theorem [2] the n tuples (F_„+i, . . . Yq), . . . , {Yq, . . . l^-i) are examined. The 
receiver decodes the s-th codeword as the 0-th message of sender 2 if there exists an n tuple of examined output 
(Y^n+i, ■ ■ ■ ^i-i) such that the first an symbols of the s-th codewords are jointly typical with the first an symbols of 
{Y_n+i, ■ ■ ■ Yi^i) and the same is true for the last (1 — a)n symbols according to channels and respectively, 
and there are no other codewords with this property. With the shifted versions of this decoding technique the receiver 
also decodes the —2, —1, 1, 2-th messages of sender 2 to ensure the decoding of the O'th message of sender 16 in the 
last successive step. Note also that implicitly the receiver learns the delay of sender 2 (See Remark [8|. 

In the following successive step the receiver decodes the — 2, — 1, 0, 1, 2-th codewords of sender la considering 
typicality according to the channels 

W^°-{y,X2\xia) = ^ ^ PXi,{xib)pX2{x2)PX3{x3)W'{y\xia,Xib,X2,X3) (66) 

xib&Xi xseXs 

and 

W^''{y,X2\xia) ^ ^ ^ Px-^Axib)Px2{x2)PXs{x3W{y\xia,Xib,X2,X3). (67) 

Xib&Xi X3£X3 

In the third successive step the decoder deals with sender 3. Note that sender 3 is not synchronized with senders 
la, 16, 2, hence using the -2,-1,0,1,2'th codewords of the senders 2 and la the receiver can decode (surely) just the 
— 1, 0, 1-th codewords of sender 3. It is also true in this case that the symbols of the codewords of sender 3 go through 
two different channels: 

W^{y,Xia,X2\x3) = ^ PXi^{xia)PXiJxib)pX2{x2)W'{y\xia,Xib,X2,X3) (68) 
xib&Xi 

and 

W^{y,Xla,X2\x3) ^ ^ Px^Jxia)Px^Axib)PxAx2)W'{y\xia,Xib,X2,X3). (69) 
Xit,£Xi 

But there is an essential difference: due to the assumption on the delays it is not known which part of the codewords 
goes through the channel W'^, it can be any an consecutive symbols of the codewords. Here the word consecutive 
is understood modulo n. When the receiver is looking for typicality, 71^ joint typicality examinations are necessary 
according to the n possible positions of the separating line of the two possible channels and the possible codeword 
positions. 

In the final successive step the receiver decodes the 0-th codeword of sender 16 considering typicality according to 
the channels 

W^''{y, Xla, X2,X3\xib) PXi^ (xia)PX2 {X2)PX3 {X3W {y\xia, Xib, X2, X3) (70) 

and 

W^''{y, Xla, X2,X3\xib) = Px^^ {xia)Px^ {x2)Px^ {X3)W' {y\xia, Xib, X2,X3). (71) 

Following the calculation method of Theorem[2]it can be calculated that the rate tuple a{Ria, Ru, R2, R3) + (1 ~ 
a){Ria, Rib, R2, R3) is achievable with this method. Note that a genie added version of the model is necessary to a 
fully rigorous error estimation, as in [7 . It is also crucial that the number of joint typicality examinations is polynomial 
in n. One part of the complete calculation can be found in Appendix C. ■ 
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7 Summary 



This paper provides a general converse for the asynchronous multiple access channels which depends on the distribution 
of the delays. Interesting examples of capacity regions between the simple union and its convex closure are given. 
These previously unknown capacity regions are: when the set of possible delays are restricted to the even numbers, and 
when 2 out of 3 senders are synchronized but the third is not. These examples show, that the theory of asynchronous 
systems is far from complete. 

It would be interesting to define a modified model where there are pauses in the communication and hence the 
delays could change. Further it would be important to model practical delay systems, and analyze systems with 
several groups of senders where each group is synchronized. Moreover, it would be interesting to solve the open 
problem proposed in [6j in which the possible delays are in the set {0, 1, . . . , an} for some a G (0, 1). 



8 Appendix A 

The coding theorem for totally asynchronous MAC (with two senders) was first stated in [T?|, for the case when receiver 
knows the delays. The theorem was stated for maximal error but the converse actually proved even for average error. 
While the paper |14| is hard to read, with the help of the reviewers of the previous version of this paper we have 



checked that the converse proof is correct, up to a minor gap pointed out after eq. ( 19 ). The achievability proof in |14| 
is not addressed here since more accessible proofs have been published since then. In [10^ the same capacity region 
was claimed to be achievable also when the receiver was uninformed. However, in the delay-detection part of the proof 
in Appendix 1 of [10' there is a gap, in eq. (12b) an independence is assumed that need not hold when the examined 
n-block of channel input symbols consists of two parts, a codeword part and a sync sequence part. The other part of 
the proof addresses decoding the sent codewords when the delay is already known. This part is correct, giving rise to 
a valid achievability proof in the case of informed receiver. Another such proof was given in [7] via rate splitting and 
successive decoding, for any number of senders. Achievability in the uninformed receiver case has not been revisited 
until recently. The error in [TD] mentioned above was corrected in [B . Our approach to delay detection differs from 
that of |10j and [6j not so much in the latter employing a sync sequence (the first version of [5^ also did that) but 
rather in our using a technique from [H] to bound the probability of delay detection error, while in [5j a different clever 
method is used to fill the gap in |10| . 



9 Appendix B - Proof of Theorem [T] 



Let S C [K]. We will derive a bound for R{S). As in Section 3, take a window of the receiver consisting of + 1 
n-length blocks Y'^"'"-'- = {Yojilj • ■ • ; i^n(7v+i)-i} £^nd the codewords having index between 1 and N from all senders 
(they are fully covered by this window). Recall that D denotes the delay vector and 'Kbs+Db denotes the random 
vector with components X/^i+D, , I E B where B C [K]. Denote by X'*™ the 2K input codewords which overlap with 
the beginning and end of Y^+^. Then 

NnR{S) = (72) 
= H(M^) (73) 
= I(M^ AM^)+ 

+ H(M^|M^) (74) 
<I(M^ AM^) +iVne„ (75) 
< I(Xf A Y^+i , X'''" , D) + Nnsn (76) 

= }i{X^\T>)-}l{X'^\Y^+\-D)+}i(X^\Y^+\-D)- 

-H(X^|X'"^,Y^+\D) +Arn£„ (77) 
< H(X^ |X^. , D) - H(X^ I Y^+\ X^. , D) + 

+ I(X™ AX§'|Y^+\D) + Arne„ (78) 
= I(X^ AY^+'^\X^o,'D) + Kn\og\X\ + Nne^ (79) 
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= H(Y^+i |X^. , D) - H(Y^+i |X^, X^. , D)- 

+ Kn log \X\+ NnSn 
= H(Y^+i|Xg'e,D) +ii'n log I A-] +iVne„- 

N n-1 

-EE H(y„,+.|Yr+''-\x^,x^.,D) 

<H(Y^+i|X^.,D)- 

- E E '^{^nj+i\^\K],nj+i+DiK]^'^)~^ 

3 = 1 ^=Q 

+ /'i:nlog|A'| +Nn£n. 



(80) 
(81) 



(82) 



Now introduce the a random variable Yi linked to the random variables Xi^i(Q]j-^,X2^i^D2T ■ ■ i^K,i®DK by the 
channel W for all i € {0, 1, . . . , n — 1}. Then (82 1 is continued as 



n-1 



< E [(^ - 1) H(f,|Xsc,,eD,c , D) + X H(y,)- 



i=0 

Afn+n-l 



i=0 



+ H(F,)-(iV-l)H(f,|X[^],,eDj^,,,D) 

i=Arn 

+ ifn log I A"! + Arn£„ 

n-1 

<{N - 1) X I(X5,.eDs A fQ|Xsc,,eDsc |D)+ 

j=0 



(83) 



(84) 



+ 2n log 13^1 +Kn\og\X\ + NnSn 
Dividing by Nn and going with N to infinity give 

R{S) < l(Xs,Q®Ds A YglXse^QeDso, Q, D) + En- 

This proves Theorem [T] 

10 Appendix C - Some calculations to Theorem [5] 

The coding/decoding task of sender 3: the random code-book of the third sender consists of i.i.d symbols with 
distribution px,- This code-book contains 2(""-f(^3^'*'l-^i».^^)+(i-")^('^='^'^l^i-^2))-'^' codewords, an consecutive 
symbols of an input codeword go through channel W^, while (1 — a)n consecutive symbols of the codeword go through 
channel W^. Here 'consecutive' is understood modulo n. Let us denote by T C {0, . . . , n — 1} those indices when 
was used. T will be called separating pattern. Note that |r| = an and T contains consecutive numbers. The 
separating pattern depends on the relative delay D between the synchronized senders la, lb, 2 and the unsynchronized 
sender 3. The decoder sees an output flow (note that the symbols of senders la, 2 are also the part of the output). 
The decoder does not know where the codewords are separated, and does not know the separating pattern of the two 
possible channels. Hence the decoder should check the same output n tuples as the decoder of Section 2 when looking 
for joint typicality, but when it examines an output n tuple the decoder should check every possible separating 
pattern. We say that the s'th codeword is typical in window F" relative to separating pattern T if parts of the 
codewords consisting of the T coordinates of X"(s) and are jointly typical according to channel W^, and the same 
is true for the coordinates T'^ — {0, . . . , n — 1} \ T according to channel W^. If s is the only codeword which is typical 
in all the examined output windows relative to all separation patterns, then the decoder's estimation is s for the O'th 
message. Let us consider first the case when the examined output window is an output of the r'th codeword and when 
the real separating pattern is T. Then in this window the r'th codeword will be typical relative to T with probability 
exponentially close to 1 by classical arguments. First we show that no other codewords will be typical in this window. 
Let T be any separation pattern (it can be T too). We will estimate the probability that the s ^ r'th codeword will 
be typical in this window relative to T . 

For any separation patter T let P^^ (x", y") be the joint distribution on X^"^ ' x 3^'^ ' induced by the |T |-th power 
of pxs and by the memoryless channel W'^. Let be the marginal of P^^ on 3^'-^ L Similarly let (a;", y") be the 
joint distribution on X^^ ' x 3^1^ I induced by the |T |-th power of px^ and by the memoryless channel W^. Let q]^ 
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be the marginal of P^^ on 3^1^ L Furthermore, if x is an n-length sequence, then will denote the vector of length 
\T I consisting of those coordinates of x which are in |T |. 

Pwd {(^i(s), . . . ,x„(,s), Fi, . y„) G sUt')} 

(x''(a),y")eSj(T') 

• PWd {{Yi, ...,Yn)= y"|(Xi(s), . . . = x"(s)} (85) 



ql y (y 



(x''W,y")eS^(T') 

PWd {{Yi, . . . = y"|(Xi(s), . . .,Xn{s)) = x"(s)} (86) 



< ^ 2-"«(^(^3Ar|Xi<„X2)-5) 

(x"(s),y")65^(T') 
. 2-n(l-a)(-f(^3AF|Xi„,X2)-<5) 

pX(x^'(.),y^')Pj;(x^^.),y^'°) 

«£(y^')C(y^") 

• Prcond {{Yi, . . . ,y„) = y"|(^i(s), . . . ,X„(s)) = x"(s)} (87) 

^ 2-na{I{X3AY\Xi^,X2)-S) 
, 2-n(l-a)(-f(^3AF|Xi„,X2)-<5) 

^ ^X(x^'(^)|y^')C(x^^«)|y^") 

(x"(s),y")eS^(T') 

• PWd {(Fi, . . . , y„) = y"|(Xi(s), . . . ,X„(s)) = x"(s)} (88) 

Note that {Xi{s), . . . ,X„(s)) is independent from the output window (s ^ r), hence 1 is an upper bound of the 
inner sum. Note also that if one should expand Prcond {(^i, • • • ,Yn) = y"'|(Xi(s), . . . ,X„(s)) = x"(s)} then the real 
separation pattern T should be considered. If the examined output window is related to two codewords (the r'th and 
the rth codewords, where r ^ I) then the same argument works if s 7^ / and s ^ r. If one of them is equal to s then 
Gray's summing technique works, the derivations can be done as in Section 4a. 
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